From statistical methods to deep learning, automatic keyphrase prediction: A survey
نویسندگان
چکیده
Keyphrase prediction aims to generate phrases (keyphrases) that highly summarizes a given document. Recently, researchers have conducted in-depth studies on this task from various perspectives. In paper, we comprehensively summarize representative the perspectives of dominant models, datasets and evaluation metrics. Our work analyzes up 167 previous works, achieving greater coverage than surveys. Particularly, focus deep learning-based keyphrase prediction, which attracts increasing attention in recent years. Afterwards, conduct several groups experiments carefully compare models. To best our knowledge, is first attempt these models using identical commonly-used metric, facilitating analyses their disadvantages advantages. Finally, discuss possible research directions future.
منابع مشابه
Nonresponse prediction in an establishment survey using combination of statistical learning methods
Nonrespose is a source of error in the survey results and national statistical organizations are always looking for ways to control and reduce it. Predicting nonrespons sampling units in the survey before conducting the survey is one of the solutions that can help a lot in reducing and treating the survey nonresponse. Recent advances in technology and the facilitation of complex calculations...
متن کاملAutomatic Keyphrase Extraction based on NLP and Statistical Methods
In this article we would like to present our experimental approach to automatic keyphrase extraction based on statistical methods and Wordnet-based pattern evaluation. Automatic keyphrases are important for automatic tagging and clustering because manually assigned keyphrases are not sufficient in most cases. Keyphrase candidates are extracted in a new way derived from a combination of graph me...
متن کاملMulti-View Representation Learning: A Survey from Shallow Methods to Deep Methods
Recently, multi-view representation learning has become a rapidly growing direction in machine learning and data mining areas. This paper first reviews the root methods and theories on multi-view representation learning, especially on canonical correlation analysis (CCA) and its several extensions. And then we investigate the advancement of multi-view representation learning that ranges from sh...
متن کاملRetrieval Term Prediction Using Deep Learning Methods
This paper presents methods to predict retrieval terms from relevant/surrounding words or descriptive texts in Japanese by using deep learning methods, which are implemented with stacked denoising autoencoders (SdA), as well as deep belief networks (DBN). To determine the effectiveness of using DBN and SdA for this task, we compare them with conventional machine learning methods, i.e., multi-la...
متن کاملA Supervised Learning Approach for Automatic Keyphrase Extraction
Keyphrases, synonymously spoken as keywords, represent semantic metadata and play an important role to capture the main theme represented by a large text data collection. Although authors provide a list of about five to ten keywords in scientific publications that are used to map them to respective domains, due to exponential growth of non-scientific documents either on the World Wide Web or in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information Processing and Management
سال: 2023
ISSN: ['0306-4573', '1873-5371']
DOI: https://doi.org/10.1016/j.ipm.2023.103382